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ABSTRACT 



We estimate the effeets of Wal-Mart stores on eounty-level retail employment and earnings, aeeounting 
for endogeneity of the loeation and timing of Wal-Mart openings that most likely biases the evidenee 
against finding adverse effeets of Wal-Mart stores. We address the endogeneity problem using a natural 
instrumental variables approaeh that arises from the geographie and time pattern of the opening of 
Wal-Mart stores, which slowly spread out from the first stores in Arkansas. The employment results 
indieate that a Wal-Mart store opening reduees eounty-level retail employment by about 150 workers, 
implying that eaeh Wal-Mart worker replaees approximately 1.4 retail workers. This represents a 
2.7 pereent reduetion in average retail employment. The payroll results indieate that Wal-Mart store 
openings lead to deelines in eounty-level retail earnings of about $1.4 million, or 1.5 pereent. Of eourse, 
these effeets oeeurred against a baekdrop of rising retail employment, and only imply lower retail 
employment growth than would have oeeurred absent the effeets of Wal-Mart. 



David Neumark 
Department of Eeonomies 
3151 Soeial Seienee Plaza 
University of California, Irvine 
Irvine, CA 92697-5100 
and NBER 
dneumark@ uci.edu 



Stephen Ciccarella 
Publie Poliey Institute of Califomia 
500 Washington Street, Suite 800 
San Franeiseo, CA 94111 
eieearella @ ppie . org 



Junfu Zhang 

Department of Eeonomies 
Clark University 
Worcester, MA 01610 
juzhang@clarku.edu 




I. Introduetion 



Wal-Mart is more than just another large eompany. It is the largest eoiporation in the world, with 
total revenues of $285 billion in 2005. It employs over 1.2 million workers in the United States, at about 
3,600 stores. 1 To put this in perspeetive, the Wal-Mart workforce represents just under 1 pereent of total 
employment and just under 10 pereent of retail employment in the United States. It exceeds the nurnber 
of high sehool teaehers or middle sehool teaehers, and is just under the size of the elementary sehool 
teaeher workforce. Wal-Mart is reported to be the nation’s largest groeer, with a 19 pereent market share, 
and its third-largest pharmaey, with a 16 pereent market share [6]. 

During the past two deeades, as Wal-Mart shaiply expanded its nurnber of stores in the United 
States, it inereasingly eneountered resistanee frorn loeal eommunities. Opponents of Wal-Mart have tried 
to bloek its entry on many grounds, ineluding the prevention of urban sprawl, preservation of historieal 
eulture, proteetion of the environment and “main-stteet” merehants, and avoidanee of road eongestion. 2 
Yet two of the most eommonly-heard eritieisms are that Wal-Mart eliminates more retail jobs than it 
ereates for a eommunity, and that it results in lower wages, espeeially in retail. 3 Wal-Mart executives 
dispute these elairns, espeeially with regard to employment. For example: its Viee President Bob 
MeAdam has argued that there are many loeations where Wal-Mart ereates jobs in other businesses in 
addition to what Wal-Mart itself offers [37]; the Wal-Mart web-site Walmaitfacts.com trumpets the 
positive effeets of Wal-Mart stores on retail jobs in the eommunities where stores open; 4 and an 
advertisement run in the USA Today, The Wall Street Journal, and The New York Times on January 14, 
2005, displayed an open letter from Lee Seott, Wal-Mart President and CEO, stating “This year, we plan 
to ereate rnore than 100,000 new jobs in the United States.” 5 Of eourse Wal-Mart offers other potential 
benefits in the form of lower priees for eonsumers ([3], [18]). 



1 See http://www.walmartfacts.eom/newsdesk/wal-mart-fact-sheets.aspx#al25 (as of September 8, 2005). 

2 See, for example, [7], [23], [24], [27], [34], and [40]. 

3 See, for example, [36], [39], and [48]. 

4 See http://www.walmartfacts.com/newsdesk/wal-mart-fact-sheets.aspx (as of Deeember 15, 2005). 

5 If this refers to gross rather than net job ereation, it eould be eonsistent with Wal-Mart destroying more jobs than it 
ereates. 
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In this paper, we seek to provide a definitive answer regarding whether Wal-Mart ereates or 
eliminates jobs in the retail seetor, relative to what would have happened absent Wal-Mart’s entry. Also, 
beeause of eoneern over the effeets of Wal-Mart on wages, and beeause polieymakers may be interested 
in the irnpaet of Wal-Mart on taxable payrolls, we also estimate the effeets of Wal-Mart on earnings in the 
retail seetor, refleeting the eombination of influenees on employment, wages, and hours. We believe that 
our evidenee improves substantially on existing studies of these and related questions, most importantly 
by implementing an identifieation strategy that aeeounts for the endogeneity of store loeation and timing 
and how these may be eorrelated with future ehanges in eai'nings or employment. Indeed, it has been 
suggested that Wal-Mart’s explicit strategy was to loeate in small towns where the population growth was 
inereasing [44, p. 92], and it is reasonable to expect that Wal-Mait entered markets where projeeted retail 
growth was strong. If Wal-Mait tends to enter fast-growing areas in booming periods, then we rnight 
expect to observe employment and earnings rising in apparent response to Wal-Mait’s entry, even if the 
stores aetually have negative effeets. 

Our identifieation strategy is driven by a systematie pattern in the openings of Wal-Mait stores. 

Sam Walton, the founder of Wal-Mait, opened the first Wal-Mait store in 1962 in Rogers, Arkansas, in 

Benton County. Five years later, Wal-Mait had 18 stores with $9 million of annual sales. Wal-Mait first 

grew into a loeal ehain store in the northwest pait of Arkansas. It then spread to adjaeent states sueh as 

Oklahoma, Missouri, and Louisiana. Frorn there, it kept expanding to the rest of the eountry after eloser 

markets were largely saturated [44, pp. 28-29]. The relationship between Wal-Mart stores’ opening dates 

and their distanee to the headquaiters is primarily a result of Wal-Mart’s “saturation” sti'ategy for growth, 

which was based on eontrol and distribution of stores, as well as word-of-mouth advertising. In his 

autobiography, Sam Walton deseribes the eontrol and distribution rnotive as follows: 

“[Our growth strategy] was to saturate a market area by spreading out, then filling in. In the eaidy 
growth years of diseounting, a lot of national eompanies with distribution systems akeady in 
plaee — Kmart, for example — were growing by stieking stores all over the eountry. Obviously, 
we eouldn’t support anything like that. . . . We figured we had to build our stores so that our 
distribution eenters, or warchouscs, eould take eare of them, but also so those stores eould be 
eontrolled. We wanted them within reaeh of our dishiet managers, and of ourselves here in 
Bentonville, so we eould get out there and look after them. Eaeh store had to be within a day’s 
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drive of a distribution eenter. So we would go as far as we eould from a warehouse and put in a 
store. Then we would fill in the map of that territory, state by state, eounty seat by eounty seat, 
until we had saturated that market area. . . . So for the rnost part, we just started repeating what 
worked, stamping out stores eookie-eutter style” [49, pp. 110-111]. 

One rnight wonder whether this need to be near a distribution eenter requires a steady spreading 
out frorn Arkansas. Why not, for example, open distribution eenters further away, and build stores near 
thern? The explanation seems to lie in the word-of-mouth advertising advantage pereeived to result from 
the growth strategy Wal-Mart pursued: 

“This saturation strategy had all sorts of benefits beyond eontrol and disti ibution. Frorn the very 
beginning, we never believed in spending mueh money on advertising, and saturation helped us to 
save a fortune in that department. When you rnove like we did frorn town to town in these rnostly 
rural areas, word of mouth gets your message out to eustomers pretty quickly without mueh 
advertising. When we had seventy-five stores in Arkansas, seventy-five in Missouri, eighty in 
Oklahoma, whatever, people knew who we were, and everybody except the merehants who 
weren’t diseounting looked forward to our eoming to their town. By doing it this way, we usually 
eould get by with distributing just one advertising eireular a month instead of running a whole lot 
of newspaper advertising” [49, p. 1 1 1]. 6 

Wal-Mart’s praetiee of growing by “spreading out” geographieally means that distanee frorn 
Benton County, Arkansas, and time — and rnore speeifieally their interaetion — help prediet when and 
where stores opened. 7 Thus, the key innovation in this paper is to instrument for the opening of Wal-Mart 
stores with interaetions between time and the distanee between Wal-Mart host eounties and Benton 
County, Arkansas, where Wal-Mart hcadquartcrs are loeated and the first Wal-Mart store opened. 

II. Literature Review 

There are a nurnber of studies that address elaims about Wal-Mart's impaets on loeal labor 
markets, emphasizing the retail seetor. However, we regard mueh of this literature as uninformative 
about the eausal impaet of Wal-Mait on retail employment and earnings. Sorne of the existing work is by 
advoeates for one side or the other in loeal politieal disputes regarding Wal-Mait’s entry into a partieular 
market. These studies are often hastily prepared, plagued by flawed methods and arbitrary assumptions, 



6 See Holmes [22] for an explanation of Wal-Mart’s expansion pattern based on the eost savings of loeating stores 
elose to one another. He ealls sueh savings the “eeonomies of density” that are derived from not just the word-of- 
mouth advertising and shared distribution system. For example, opening new stores near existing ones also makes it 
eost-effeetive for Wal-Mart to hire and train employees at existing stores and then transfer them to nearby new 
stores onee they open. 

7 For example, Wal-Mart’s expansion did not reaeh California until 1990. It first entered New England in 1991. In 
1995 Wal-Mart opened its first store in Vermont and finally had a presenee in all 48 eontiguous states. 
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and sponsored by interested parties sueh as Wal-Mart itself, its eompetitors, or union groups (e.g., [5], 
[13], and [41]), and ean hardly be expected to provide impartial evidenee on Wal-Mart’s effeets. Henee, 
they are not summarized here. 

There is also an aeademie literature on the impaet of Wal-Mart stores, foeusing on the effeets of 
Wal-Mart openings on loeal employment, retail priees and sales, poverty rates, and the eoneentration of 
the retailing industry, as well as the impaet on existing businesses. This researeh is limited by three main 
faetors: the restrietion of rnueh of it to srnall regions (often a single small state); its laek of foeus on 
employment and earnings effeets; and its failure to aeeount for the endogeneity of Wal-Mart loeations, 
either at all or, in our view, adequately. 

Many of these studies, espeeially the early ones, foeus on the effeets of Wal-Mart at the regional 
level, spumed by the expansion of Wal-Mart into a partieular region. Most of these studies foeus on the 
effeets of Wal-Mart on retail businesses and sales, rather than on employment and earnings. The earliest 
study, which is typieal of mueh of the researeh that has followed, is by Stone [44]. He defines the “pull 
faetor” for a speeifie merehandise eategory as the ratio of per eapita sales in a town to the per eapita sales 
at the state level, and examines the ehanges in the pull faetor for different merehandise eategories in host 
and suiTounding towns in Iowa after the opening of Wal-Mart stores. Stone finds that, in host towns, pull 
faetors for total sales and general merehandise (to which all Wal-Mart sales belong) rise after the arrival 
of Wal-Mart. Pull faetors for eating and drinking and home furnishing also go up beeause Wal-Mart 
brings in more eustomers. However, pull faetors for groeery, building materials, apparel, and speeialty 
stores deeline, presumably due to direet eompetition from Wal-Mart. He also finds that small towns 
surrounding Wal-Mart towns suffer a larger loss in total sales eompared to towns that are further away. 8 
Related results for other regions — which generally, although not always, point to similar eonelusions — 
are reported in [1], [2], [9], and [28]. All of these studies use administi ative data, and employ researeh 



8 Stone’s study was updated regularly (see, for example, [45], [46]), but its eentral message remained the same: Wal- 
Mart pulls more eustomers to the host town, hurts its loeal eompetitors, but benefits some other loeal businesses that 
do not direetly eompete with it. Using the same methods, Stone et al. [47] show similar results regarding the effeets 
of Wal-Mart Supereenters on existing businesses in Mississippi. 
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designs based on before-and-after eompai'isons in loeations in which Wal-Mart stores did and did not 
open. 9 

The studies reviewed thus far do not address the potential endogeneity of the loeation and timing 
of Wal-Mart’s enti'y into a partieular market. In addition, these studies do not foeus on the key questions 
with which this paper is eoneerned — the effeets of Wal-Mart on retail employment and earnings. A few 
studies eome eloser to the mark. Ketehum and Hughes [29], studying eounties in Maine, recognize the 
problem of the endogenous loeation of Wal-Mart stores in faster-growing regions. They attempt to 
estimate the effeets of Wal-Mart on employment and earnings using a differenee-in-differenee-in- 
differenees (DDD) estimator that eompares ehanges in retail employment and earnings over tirne in 
eounties in which Wal-Mart stores did and did not loeate, eompared to ehanges for manufaeturing and 
serviees. However, virtually none of their estimated ehanges are statistieally signifieant and the data 
appear very noisy, so the results are generally uninformative. More important, their approaeh does not 
address the key endogeneity questions of whether Wal-Mait loeation deeisions were based on antieipated 
ehanges after stores opened, or instead only prior trends that were akeady different (despite the authors 
posing these questions). 10 Hieks and Wilburn [20], studying the impaet of Wal-Mart openings in West 
Virginia, estimate positive impaets of Wal-Mait stores on retail employment and the number of retail 
firms. They do not explicitly aeeount for endogeneity, although they do address the issue. In partieular, 
they report evidenee suggesting that Wal-Mait loeation deeisions are independent of long-term eeonomie 
growth rates of individual eounties in their sample, and that eurrent and lagged growth have no signifieant 
effeet on Wal-Mait’s deeision to enter. However, these results do not explicitly address endogeneity with 



9 A eouple of studies rely on surveys of loeal businesses rather than administrative data. MeGee [32] reports results 
from a small-seale survey of small retailers in five Nebraska eommunities eondueted soon after Wal-Mart stores 
entered. He finds that 53 pereent of the responding retailers reported negative effeets of Wal-Mart’s arrival on their 
revenues while 19 pereent indieated positive effeets. In a survey of Nebraska and Kansas retailers, Peterson and 
MeGee [38] fmd that less than a third of the businesses with at least $1 million in annual sales reported a negative 
effeet after Wal-Mart’s arrival, while elose to one half of the businesses with less than $1 million in annual sales 
indieated a negative effeet, with negative effeets most eommonly reported by small retailers in eentral business 
distriets. The researeh design in these surveys fails to inelude a eontrol group eapturing ehanges that might have 
oeeurred independently of Wal-Mart openings. In addition, reported assessments by retailers may not refleet aetual 
effeets of these openings. 

10 The seeond question ean only be addressed via an instmmental variables approaeh, and the first requires looking 
at ehanges in growth rates, not ehanges in levels; their study only does the latter. 
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respeet to future growth. The latter, in paitieular, eould generate apparent positive impaets of Wal-Mart 
stores. 

In more reeent work, Basker [4] studies the effeets of Wal-Mart on retail employment using 
nationwide data. Basker attempts to aeeount explicitly for endogeneity by instrumenting for the aetual 
nurnber of stores opening in a eounty in a given year with the planned nurnber. The latter is based on 
numbers that Wal-Mart assigns to stores when they are planned; aeeording to Basker, these store numbers 
indieate the order in which the openings were planned to oeeur. She then eombines these numbers with 
information from Wal-Mart Aiuutal Reports to measure planned and aetual openings in eaeh eounty and 
year; the data refleet some measurement error in store opening dates. 11 Her results indieate that eounty- 
level retail employment grows by about 100 in the year of Wal-Mart entry, but deelines to a gain of about 
50 jobs in five years as other retail establishments eontraet or elose. In the meantime, possibly beeause 
Wal-Mart streamlines its supply ehain, wholesale employment deelines by 20 jobs in the longer term. 12 

The prineipal problem with this identifieation strategy, however, is that the instrument is 
uneonvineing. For the instrument to be valid, two eonditions rnust hold. The first is that planned store 
openings should be eoiTelated with (predietive of) aetual openings; this eondition is not problematie. The 
seeond eondition is that the variation in planned openings generates exogenous variation in aetual 
openings that is uneoiTelated with the unobserved determinants of employment that endogenously affeet 
loeation deeisions. This seeond eondition holds if we assume, to quote Basker, that “the nurnber of 
planned Wal-Mart stores ... for eounty j and year t is independent of the error term . . . and planned Wal- 
Mart stores affeet retail employment per eapita only insofar as they are eorrelated with the aetual 
eonstruetion of Wai-Mart stores” [4, p. 178]. The seeond part of the assumption is potentially 
problematie; aetual stores should, of eourse, be the driving influenee, although planned stores — even if 

11 This is detailed in an appendix available from the authors. 

12 Using the same instrumental variables strategy, Basker [3] estimates the effeets of Wal-Mart entry on priees of 
eonsumer goods at the eity level, finding long-run deelines of 8-13 pereent in priees of several produets ineluding 
aspirin, detergent, Kleenex, and toothpaste, although it is less elear to us why loeation deeisions would be 
endogenous with respeet to priee. Ordinary least squares (OLS) estimation also finds long-run negative effeets of 
Wal-Mart on priees for nine out of ten produets, although only three are signifieant and all are smaller than the IV 
results. Hausman and Leibtag [18] study the effeets of Wal-Mart on food priees. 
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they do not materialize or do so only with delay — may still affeet deeisions of other businesses. 13 The 
first part of the assumption is a rnore serious eoneern, though, as it seems most likely that planned 
openings will refleet the same unobserved determinants that drive endogenous loeation as are refleeted in 
aetual openings, and we eannot think of an argument to the eontrary (nor does Basker offer one). In this 
ease if the OLS estimate of the effeet of Wal-Mart stores on retail employment is biased upward beeause 
of endogeneity, the instrumental variable (IV) estimate may also be upward biased, and rnore so than the 
OLS estinrate. 14 

Using various data sourees, Goetz and Swaminathan [14] study the relationship between Wal- 
Mart openings between 1987 and 1998 and eounty poverty rates in 1999, eonditional on 1989 poverty 
rates (as nreasured in the 1990 and 2000 Censuses of Population). They also use an IV proeedure to 
address the endogeneity of Wal-Mart entry, instrunrenting for Wal-Mart openings during 1987-1998 in an 
equation for eounty poverty rates in 1999. Their IVs inelude an unspeeified pull faetor, aeeess to 
interstate highways, earnings per worker, per eapita property tax, population density, pereentage of 
households with more than three vehieles, and nunrber of female-headed households. The results suggest 
that eounty poverty rates inerease when Wal-Mart stores open, perhaps beeause Wal-Mart lowers 
earnings (although the authors offer other explanations as well). However, why the IVs should affeet 
Wal-Mart openings only, and not ehanges in poverty direetly (eonditional on Wal-Mart openings), is not 
elear, and it is not hard to eonstruet stories in which invalid exclusion restrietions would ereate biases 
towards the finding that Wal-Mart openings inerease poverty. 15 



13 A telling example oeeurred in 2003, after Wal-Mart announeed plans to build 40 Supereenters in California. 
Supermarket ehains in southern California ineluding Albertsons, Ralphs, and Vons immediately sought to lower 
eosts and avoid being undereut by Wal-Mart, leading to four months of labor eonfliet, all before Wal-Mart opened 
its first Supereenter in California (in Mareh 2004). See, for example, [16] and [21]. 

14 See the diseussion in the longer working paper version of this study [35]. Basker also motivates the IV as 
eorreeting for bias from measurement error in the aetual opening dates of Wal-Mart stores in her data. But the same 
argument against the validity of planned openings as an instrument applies. 

15 For example, eonsider the use of the number of female-headed households as an IV, and suppose that this variable 
is positively eorrelated with ehanges in poverty rates (beeause of rising inequality over this period), and also 
positively eorrelated with Wal-Mart openings (beeause they loeate in lower-income areas). In this ease the IV 
estimate of the effeet of Wal-Mart openings on ehanges in poverty rates is biased upward beeause of the positive 
eorrelation between the instrument (female-headed households) and the error term in the equation for the ehange in 
the poverty rate. In addition, although the authors do not speeify how they eonstruet their pull faetor, we assume it 
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Our researeh addresses the four prineipal shorteomings of the existing researeh on the effeets of 
Wal-Mart on loeal labor markets. First, we estimate the effeets of Wal-Mart openings on retail 
employment (as well as earnings). Seeond, we have — we believe — a far more eonvineing strategy to 
aeeount for the potential endogeneity of Wal-Mart openings. Third, we are able to use administrative data 
on Wal-Mart openings that eliminate the measurement eiTor in reeent work. And finally, we use a data 
set that is national in seope. 

III. Data 

The analysis is done at the eounty level. Aside frorn following Basker’s strategy, this strikes us 
as a reasonable geographie level of disaggregation at which to deteet the effeets of Wal-Mart stores — not 
so small that many of the effeets may oeeur outside of the geographie unit, and not so large that the 
effeets may be undeteetable. 

Employment and payroll data are drawn frorn the U.S. Census Bureau’s County Business Patterns 
(CBP). 16 CBP is an annual series that provides eeonomie data by industry and eounty. The series 
ineludes most eeonomie aetivity, but excludes data on self-employed individuals, employees of private 
households, railroad employees, agrieultural produetion workers, and most government employees. 
Payroll in the CBP ineludes salaries, wages, reported tips, eommissions, bonuses, vaeation allowances, 
siek-leave pay, employee eontributions to qualified pension plans, and taxable fringe benefits, and is 
reported before deduetions for Soeial Seeurity, ineome tax, insuranee, ete. 17 It does not inelude profit or 
other eompensation earned by proprietors or business partners. Payroll is reported on an annual basis. 

The employment measure in the CBP data is a eount of jobs, rather than the number of people employed 
(in one or more jobs). Employment eovers all full- and part-time employees, ineluding offieers and 
executives, as of the pay period ineluding Mareh 12 of eaeh year. Workers on leave are ineluded, while 
proprietors and partners are not. 

is similar to the measure deseribed above — a ratio of eounty to statewide retail sales. Given that this is a dependent 
variable in other studies of the effeets of Wal-Mart, it is hard to justify using it as an IV for Wal-Mart openings. 

16 A good deseription of the CBP data is available at http://www.census.gov/epcd/cbp/view/cbpview.html (as of 
August 15, 2005). 

17 Given that it excludes non-taxable fringe benefits, of which the most important is health insuranee, we refer to the 
CBP measure as earnings, rather than eompensation. 




The most signifieant limitation of the CBP data for studying the effeets of Wal-Mart is that a 
wage eannot be eomputed. The CBP data do not provide a breakdown of employment into full-time and 
part-time workers, or any information on skill eomposition. Thus, we eannot tell whether ehanges in 
payrolls refleet ehanges in pay rates for eomparable workers or shifts in skill eomposition or hours. As a 
consequence, these data eannot be used to address questions of the effeets of Wal-Mart on wages. We 
ean, though, estimate Wal-Mart's effeet on total retail payrolls. 

We use CBP data frorn 1977 through 2002. We began with 1977 beeause CBP data are not 
eontinuously maehine-readable for the years 1964-1976, and ended with 2002 beeause that was the last 
year available. As explained below, however, rnost our analysis goes only through 1995 — a period for 
which our identifieation strategy is most eompelling. 

We study the retail seetor as a whole, as well as the general merehandising subseetor — which 
ineludes Wal-Mart and other general depai tment-style stores. 18 Looking at results for these different 
retail seetors is useful for assessing and understanding the results. For example, if Wal-Mart reduees 
retail employment, we rnight expect the employment reduetion to show up for the aggregate retail seetor, 
but to be offset by a sizable inerease in general merehandising. Sorne eomplieations arise in working with 
the CBP data beeause by federal law no data ean be published that rnight diselose the operations of an 
individual employer. As we look at rnore disaggregated subsets of industries, it is more likely that data 
are suppressed and so our sample beeomes smaller. We therefore eonstrueted two samples with which we 
ean eonsistently eompare at least some retail industry seetors for the same set of observations: 

• A sample : all eounty-year observations with eomplete (non-suppressed) employment and payroll 
data for aggregate retail, and in total. 

• B sample : all observations in the A sample that also have eomplete data for the general 
merehandising retail subseetor to which Wal-Mart belongs. 

18 General merehandising ineludes retail stores which sell a number of lines of merehandise, sueh as dry goods, 
apparel and aeeessories, furniture and home furnishings, small wares, hardware, and food. The stores ineluded in 
this group are known as department stores, variety stores, general merehandise stores, eatalog showrooms, 
warehouse elubs, and general stores. (See http://www.census.gov/epcd/ec97sic/def/G53.TXT, as of June 13, 2007). 
Based on 1997 Census data, general merehandising makes up about 2.2 pereent of all retail establishments, and 10.7 
pereent of all retail employment. 
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Beeause the rules for whether or not data are diselosed depend on the size of the retail seetor and 



the size distribution of establishments within it, sample seleetion is endogenous. We therefore emphasize 
results for the A sample, which ineludes nearly all eounties and years. We use the B sample only to 
eompare estimates for aggregate retail and general merehandising; as long as any biases frorn seleetion 
into the B sample are similar aeross retail subseetors, the estimates for aggregate retail and general 
merehandising ean still be meaningfully eompared. 19 

Wal-Mart provided us with administrative data on 3,066 Wal-Mart Diseount Stores and 
Supereenters. The data set eontains every Diseount Store and Supereenter still in operation in the United 
States at the end of fiseal year 2005 (January 31, 2005), as well as the opening date. 20 Beeause 
employment in the CBP data is measured as of early Mareh in eaeh year, we eode store openings as 
oeeundng in the first full ealendar year for which they are open. If we instead eoded the store opening as 
oeeundng in the previous ealendar year (sometime during which the store did in faet open), in most eases 
(unless the store opened before early Mareh) we would be using an employment level prior to the store 
opening as a measure of employment post-opening. 21 

After dropping stores in Alaska and Hawaii, we used 2,21 1 stores in our main analysis through 
1995, and 2,795 stores when we use the full sarnple period through 2002. By 2005, Wal-Mart also had 
551 Sam’s Club stores in the United States (the first opened in 1983), on which we also obtained data, 
although the data were less eomplete (for example, laeking information on square footage). We do most 



19 Basker [4] also foeuses on the retail subseetor excluding eating and drinking plaees and automotive dealers and 
gasoline serviee eenters — subseetors that are least likely to eompete direetly with Wal-Mart. However, eonstmeting 
data for this subseetor results in nearly two-thirds of eounty-year pairs being disearded beeause of non-diselosure, in 
which ease endogenous sample seleetion ean be severe. 

20 Unfortunately, we are often missing information on when Diseount Stores eonverted to Supereenters, which is 
frequent in the latter part of the sample. All we know is the eurrent store type. We also reeeived data identifying a 
small number of stores (54, as of 2005) that elosed. We return to this issue in some of our robustness analyses. This 
data set also indieated some store reloeations within eounties, which are treated as eontinuing stores beeause Wal- 
Mart replaeed smaller, older stores with laiger ones in nearby loeations. 

21 The measurement error eorresponding to stores opening in January through early Mareh of the previous year is 
less important in the IV estimation than in OLS estimation, beeause the IV estimation identifies the effeet from the 
predieted probability of store openings rather than aetual openings, and these predietions typieally will not vary 
mueh between adjaeent years. 
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of our analysis eonsidering the Wal-Mart stores other than Sam’s Clubs, but also some analysis 
ineorporating information on the latter. 22 

We eonstrueted a eounty-year file by eolleeting eounty names and FIPS eodes for the 3,141 U.S. 
eounties frorn the U.S. Census Bureau. 23 We ereated time-eonsistent geographieal areas aeeounting for 
merges or splits in eounties during the sample period. For eounties that split during the sample period we 
maintained the definition of the original eounty, and for eounties that merged during the sample period we 
ereated a single eorresponding eounty throughout. 24 This led to a file of 3,094 eounties over 19 years (26 
years when we use the full sample), to which we merge the CBP and Wal-Mart data. We assigned to the 
eounties population data for eaeh year from the U.S. Census Population Estimates Arehives. 25 Finally, we 
eompiled latitude and longitude data for eaeh eounty eentroid frorn the U.S. Census Bureau's Census 
2000 Gazetteer Files. 26 We eonstrueted distanee measures frorn eaeh eounty to Wal-Mart headquarters in 
Benton County, Arkansas, for reasons explained below, 27 ereating dummy variables (denoted DIST) for 
eounty eentroids in rings within a radius of 100 miles from Benton County, Arkansas, 101-200 miles, ete., 
out to the maximum radius of 1800 rniles (with the veetor of these dumrny variables denoted /9ASTbclow). 
IV. Empirieal Approaeh and identifieation 

We estimate models for ehanges in retail employment and payrolls. We generally eapture 
inereased exposure to Wal-Mart stores via a measure of store openings in a eounty-year eell — i.e., the 
ehange in the nurnber of stores. We define ehanges in employment, payrolls, and number of stores on a 
per person basis, to eliminate the undue influenee of a small number of large employment ehanges in 
extraordinarily large eounties. As long as we divide all of these ehanges by the number of persons in the 
eounty, the estimated eoeffieient on the Wal-Mart variable still measures the effeet of a Wal-Mart store 

22 Sam’s Clubs are different beeause eustomers have to beeome members, like at Costco. The srnall number of Wal- 
Mart Neighborhood Markets is not ineluded in any data we have, but the first one did not open until 1998, beyond 
the sample period used for most of our analysis. 

23 Downloaded from http://www.census.gov/datamap/fipslist/AHSt.txt (as of April 5, 2005). 

~ 4 The eode for ereating eonsistent eounties over time through 2000 was provided by Emek Basker, and 
supplemented by us. 

25 Downloaded from http://www.census.gov/popest/archives/ (as of April 5, 2005). 

26 Downloaded from http://www.census.gov/tiger/tms/gazetteer/county2k.txt (as of April 5, 2005). 

~ 7 We use the Haversine distanee formula for eomputing distanees on a sphere [42]. 
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opening on the ehange in the level of retail employment or earnings. To eontrol for overall ineome 
growth that may affeet the level of demand for retail, we inelude ehanges in total payrolls per person as a 
eontrol variable, to eapture eeonomie shoeks speeifie to eounties and years that eould eoineidentally be 
assoeiated with the distanee-time interaetions that make up the instrumental variable, sinee these shoeks 
likely affeet speeifie regions in speeifie periods. In addition, all models inelude fixed year effeets to 
aeeount for aggregate influenees on ehanges in retail employment or earnings that rnight be eorrelated 
with Wal-Mart openings, which oeeur with greater frequency later in the sample. 

We denote the eounty-level measures of retail employment and payrolls (per person) as 7, the 
nurnber of Wal-Mart stores (per person) as WM, total payrolls per person as TP, and year fixed effeets (in 
yeai' s) as YR S . Indexing by eounty j (j = and year t (t = 1 ,...,!), and defining a, /I y, and S s as 

sealar parameters, our baseline model for the ehange in the dependent variable for eaeh observation jt is: 

(1) AY jr =a + pAWM l ,+ ) ATPj,+f i S,rR,+ej,. 

5=1 

Fixed eounty differenees in the levels of the dependent variables drop out of the first-differeneed 
model. However, there may be systematie variation in these first differenees aeross different regions, 
eorresponding to faster or slower growth. We allow for this in a highly flexible manner by also 
estimating the first differenee models ineluding eounty fixed effeets (CO), which allows for a different 
linear trend for eaeh eounty, so the model beeomes 

d’) A7. ( =a + pAWM jt +yATP Jt +f^SJR s +j?<p i CO i +e Jt , 

5=1 Z=1 

where the <p, are also sealar parameters. 

Endogeneity ofWal-Mart Loeation Deeisions and identifieation 

Consistent estimation of equations (1) or (l’) requires that Sj t is uneorrelated with the right-hand- 
side variables. If Wal-Mart loeation deeisions are based in part on eontemporaneous and future ehanges 
in employment or payrolls, then this eondition eould be violated. This endogeneity is natural, sinee Wal- 
Mart would be expected to rnake loeation deeisions (ineluding the loeation and tirning of store openings) 
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based on eurrent eonditions and future prospeets, which might be related to both employment and payroll. 



As but one example, Wal-Mart may open stores where real estate development and zoning have reeently 
beeome favorable to retail growth. 

Our identifieation strategy in light of this potential endogeneity is based on the geographie pattern 
ofWal-Mart store openings over time. Figure 1 illustrates how Wal-Mart stores spread out 
geographieally throughout the United States, beginning in Arkansas as of 1965, expanding to Oklahoma, 
Missouri, and Louisiana by 1970, Tennessee, Kansas, Texas, and Mississippi by 1975, mueh of the South 
and the lower Midwest by 1985, more of the Southeastern seaboard, the plains, and the upper Midwest by 
1990, and then, in turn, the Northeast, West Coast, and Paeifie Northwest by 1995. After 1995, when the 
far eorners of the eountry had been entered, there was only filling in of stores in areas that aheady had 
them. This pattern is eonsistent with Wal-Mart's growth strategy, diseussed in the introduetion. 

This pattern of growth generates an exogenous souree of variation in the loeation and tirning of 
Wal-Mart store openings that provides natural instrumental vaiiables for Wal-Mart store openings. 28 In 
partieular, tirne and distanee from Benton County, Arkansas prediet where and when Wal-Mart stores will 
open. Flowever, this does not neeessarily imply that tirne and distanee ean serve as instrumental variables 
for exposure to Wal-Mart stores. If we posit an equation for Wal-Mart openings of the form 

(2) A WM t = K + ATP^&piST^^YR^r,,, 

i = 1 5=1 

then beeause equation (l’) already ineludes year and eounty fixed effeets, and the latter eapture all the 
variation in distanee, the time and distanee variables in equation (2) give us no identifying information. 

Flowever, equation (2) does not aeeurately eapture the geographie and time-related pattern of 
Wal-Mart openings. In partieular, the additivity of the distanee and year effeets in equation (2) implies 



28 

Jia [25] addresses the endogeneity of Wal-Mart store loeations in a more struetural fashion. In partieular, she 
studies the entry of ehain stores sueh as Wal-Mart and K-Mart stores into loeal markets and their effeets on the 
numbers of srnall retailers (rather than retail employment) as a two-stage game in which Wal-Mart and K-Mart first 
enter loeal markets, and in the seeond stage srnall retailers deeide whether to enter. Her estimates suggest that 
during the 1988-1997 sample period Wal-Mart’s expansion is responsible for half to three quarters of the deeline in 
the number of small retailers. In addition, she shows that estimating a redueed-form model and ignoring the 
endogeneity of entry deeisions eould underestimate the adverse effeets of Wal-Mart on small retailers by 50-60 
pereent. 
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that differenees aeross years in the probability of Wal-Mart openings are independent of distanee frorn 
Benton County. But as shown Figure 2, which depiets openings only, in the area near Arkansas openings 
are eoneentrated in the earliest years, whereas in the 1981-1985 period, for example, openings are more 
eoneentrated further away from Arkansas. This pattern beeomes more obvious in the 1986-1990 and 
1991-1995 maps, where openings thin eonsiderably in the area of Wal-Mart’s original growth, and are 
more eo mm on first in Florida, then in the Southeast and the lower Midwest, and finally in California, the 
upper Midwest, and the Northeast. 

The faet that the rate of openings slows eonsiderably in the Southeast, for example, in the later 
years, and inereases in areas further away — in a rough sense spreading out frorn Benton County like a 
wave (albeit imegular) — eontradiets the additivity of distanee and time effeets in equation (2), and instead 
implies that the model for Wal-Mart openings should have a distanee-time interaetion, with the 
probability of openings higher early in the sample period in loeations near Benton County, but higher 
later in the sample period further away frorn Benton County. Beeause this relationship holds through 
1995, when Wal-Mart had begun to saturate border areas, we restriet rnost of our analysis to this period, 
although we also report results using the full sample through 2002. The most flexible form of this 
interaetion, expanding on equation (2), ineludes interaetions between dumrny variables for year and for 
the different distanee ranges, as in 

(3) AWM Jt = K + nATPj, +£ J A i DIST l +f J v,YR, +f J f J 9 i ,(DIST,xYR,) + nil . 

i = 1 5=1 5=1 1=1 

Given this speeifieation, the endogenous effeet of Wal-Mart openings is identified in equation (l’) 
by using the distanee-time interaetions as instruments for exposure to Wal-Mart stores. 29 ' 30 Throughout, 



29 Soon after our researeh was eompleted, we diseovered reeent researeh done eoneurrently by Dube et al. [12], 
which also exploits the geographie pattern of Wal-Mart openings to identify their effeet on retail earnings growth. 
This paper foeuses only on earnings, and restriets the analysis to the 1992-2000 period. The diseussion of the maps 
in Figures 1 and 2, which suggest that Wal-Mart’s “fanning out” growth strategy had mn its eourse after 1995 or so, 
raises eoneerns about identifieation of the effeets of Wal-Mart stores using data only from 1992-2000. In addition, 
Dube et al. estimate the rnodels in levels — i.e., retail earnings growth on the number of stores (both on a per person 
basis) — instrumenting for the number of stores with distanee-time interaetions. However, as Figures 1 and 2 make 
elear, it is openings that are predieted by the distanee-time interaetion, not the total number of stores; for example, 
the number of stores remains high near Benton County late in the sample period. As a consequence, this model is 
misspeeified. If the distanee-time interaetions appear in the equation for openings (or the first differenee for the 
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we report standard errors that eluster on state and year, which are robust to heteroseedastieity of the error 
aeross state-year eells, and to spatial autoeoiTelations aeross eounties within states. 31 

A few eomments regarding identifieation of the model are in order. First, although there are, 
teehnieally speaking, more instruments than are needed to identify the model, overidentifying tests are in 
our context uninformative about the validity of the instruments. Conceptually, there is one instrument — 
the interaetion of distanee and time — and all we do here is to use a flexible form of this variable. Were 
there a good reason — based, for example, on loeation strategies gleaned from Sam Walton’s writings — to 
think that there were partieulariy exogenous loeation deeisions in a naiTower time period or at a speeifie 
distanee frorn Benton County, then arguably one rnight want to use this information to identify the rnodel, 
and test for the validity of distanee-time interaetions from other periods or further distanees using 
overidentifieation tests. Walton’s writings, however, bolstered by the maps in Figure 2, suggest that 
distanee and time aeted in a similar fashion throughout the entire period during which Wal-Mart stores 
spread to the borders of the United States. Thus, the entire set of instruments is either jointly valid, based 
on a priori arguments, or it is not. 32 

Seeond, in addition to the distanee-time interaetions predieting store openings, we also rnust be 
able to exclude distanee-time interaetions frorn the models for ehanges in retail employment and payrolls. 
Given that points at a given distanee from Benton County are loeated on a eirele with Benton County at 
its eenter, there is no obvious reason why this eondition would be expected to be violated. A partieular 



number of stores), the equation in levels would have to have these interaetions multiplied by a linear time trend. 
which would eapture the faet that the number of stores grew faster, for example, at distanees far from Benton 
County late in the sample period. An even more reeent paper is by Drewianka and Johnson [10], who eeho our 
eoneern about the IV identifieation strategy in the more reeent period they study (1990-2004). They instead simply 
eontrol for eounty-speeifie trends. However, the faet that the IV strategy we use probably makes little sense in part 
of the period they study does not negate the endogeneity problem, which they do not address. 

30 See a longer working paper version of this paper [35] for a simple theoretieal model that gives more rigorous 
justifieation for our empirieal speeifieations. 

31 Below, we also report estimates elustering on state only, which also allows temporal autoeorrelation. The 
standard errors are only slightly larger and the qualitative eonelusions are unehanged. One problem with this level 
of elustering is that given the large set of instrumental variables we have, the implieit number of degrees of freedom 
falls below the number of exclusion restrietions, so we eannot ealeulate an F-statistie for the exclusion restrietions in 
the first stage. 

32 More generally, suppose there is a eontinuous instrumental variable Z that is argued to be valid on a priori 
grounds. One eould not ereate a set of dummy variables eorresponding to eaeh value of Z and do an 
overidentifieation test to establish the validity of Z as an instrument. 
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area — say, 500 miles straight east frorn Benton County — may have eeonomie eonditions or strueture that 
differ frorn those in Benton County and henee also have systematieally different trends (or ehanges) in 
retail employment or earnings. But this does not irnply that we should expect similar influenees at the 
same time at points that are a eommon distanee from Benton County, aside from the effeets of Wal-Mart 
in these areas. Indeed this seems likely to be partieularly true for retail, which should exhibit little or no 
regional eoneentration, but instead be roughly the same aeross regions with similar levels of ineorne. In 
eontrast, this is less likely to hold for other industries, which — beeause they need not be loeated near their 
eustomers — ean be regionally eoneentrated; manufaeturing is the prime example, but the same is true of 
many other industries. Nonetheless, the model with fixed eounty effeets allows for different trends by 
eounty over the sample period, and thus should largely eapture differential ehanges in eeonomie 
eonditions aeross regions. Henee, even when looking at retail, we have more eonfidenee in the exclusion 
restrietions for the validity of the instruments when the eounty fixed effeets are ineluded. Moreover, for 
this speeifieation there is less reason to be skeptieal of using this framework to study employment 
ehanges in other industries, which ean serve as a falsifieation test; in partieular, we estimate the model for 
manufaeturing employment, which should not be affeeted, loeally, by the opening of Wal-Mart stores. 33 

Third, one might view the pattern of roll-out of Wal-Mart stores as indieating that store openings 
are exogenous and therefore it is not neeessary to eorreet for endogeneity. However, while the overall 
pattern and tirning appear to be exogenous, the exact eounties in which stores loeate are not. That is, 
Wal-Mart’s deeision to enter an approximate “ring” around Benton County, Arkansas, in a partieular 



33 On the other hand, we eannot eompletely rule out some eorrelation of retail aetivity with eounty and year speeifie 
shoeks. Of eourse the most flexible model is the fully saturated one that ineludes eounty-speeifie fixed effeets as 
well as their interaetions with the year fixed effeets in the equations we estimate, allowing for arbitrary differenees 
in the ehanges in the dependent variables by eounty and year. In this ease, of eourse, the distanee-time interaetions 
would provide no identifying information. But even in the absenee of attempts to aeeount for endogeneity, a model 
with unrestrieted time effeets for eaeh eounty would not permit the identifieation of the effeets of Wal-Mart stores. 
Thus, a more restrieted version of how distanee-time interaetions enter the employment and earnings equations 
would have to be imposed whether or not we were addressing the endogeneity of Wal-Mart openings. 
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period may be exogenous, but where in the ring they enter is not, and it is this endogeneity with which we 
are eoneerned, and for which our instrument eorreets. ' 4 
V. Empirieal Results 

Deseriptive Statisties 

Deseriptive statisties for population, employment, and payroll are reported in Table 1. Most of 
the statisties are for the A sample, with the exception of those for general merehandising. Statisties for 
the B sample were similar, although eounties in the B sample are larger sinee data are less likely to be 
suppressed for larger eounties and eounties with larger retail seetors. On average, eounties have 
approximately 78,000 residents. The data eover 3,032 eounties, 98 pereent of the total. The average level 
of aggregate retail employment is 5,659, or about 5.5 pereent of total employment. Employment in 
general merehandising is about one-sixth of the retail total, or 966 workers on average. Average 
aggregate retail payrolls are around $92 million, and general merehandising payrolls are about $14 
million. Although not reported in the table, average retail payrolls per worker aeross eounties is around 
$13,700, and the figure for general merehandising is ai'ound $13,100. 

Table 2 provides some deseriptive statisties on Wal-Mart stores, as of 1995. As indieated in the 
first row, for eounties with one or rnore stores open the average number of stores is 1.43. The seeond row 
indieates that just over half of eounties have at least one Wal-Mart store. The remaining rows of the table 
report the distiibution of nurnber of stores per eounty (for eounties with one or more stores). Around 78 
pereent of the eounties with Wal-Mart stores have only one store, about 13 pereent have two stores, and 
around 4 pereent have three stores. There is then a smattering of observations with more stores (with a 
maximum of 17 stores at the end of the sample period in Harris County, Texas, which ineludes Houston). 
Preliminary Evidenee on Endogeneity Bias 

It is useful to ask whether we ean deteet evidenee of endogeneity, and infer something about the 
direetion of endogeneity bias, without relying explicitly on the identifying assumptions underlying the IV 

34 Khanna and Tiee ([30], [31]) have also noted the geographie pattern of the spread of Wal-Mart stores that we 
exploit. Based on this pattern, they study the effeet of Wal-Mart entry on ineumbent firms, treating Wal-Mart entry 
as exogenous. They use somewhat larger areas than we do (based on the first three digits of zip eodes), but even so, 
deeisions as to which of these to enter may be endogenous. 
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estimation. One way to get some indireet evidenee on endogeneity is to look at the relationship between 
past growth in retail employment and deeisions to open Wal-Mart stores. As Figures 1 and 2 indieate, the 
eorreet eomparison for any period is not between all eounties where stores did and did not open. Rather, 
for a partieular period we should identify the region in which many stores were opening, and ask 
whether — within that region — stores were opening in fast-growing eounties. 35 

We did this eomparison for different seleeted periods and regions to give a sense of what this 
ealeulation yields throughout the sample. We first foeused on store openings in the early 1980s, in states 
elose to Arkansas, restrieting attention to openings in the 1983-1985 period. We eomputed openings per 
state, and ehose the ten states with the highest number of openings. These ten states are shaded in the first 
map in Figure 3, which elearly shows that these are states in relatively elose proximity to Arkansas (as 
well as Florida). The map also shows the same openings, in the 1981-1985 period, which were displayed 
in Figure 2. The overlaying of these openings and the shaded states makes elear that these were the states 
in which openings were eoneentrated in this period. For eaeh eounty in these ten states, we eomputed the 
annualized rate of growth of retail employment over the immediately preeeding five-year period 1977- 
1982. 36 We then estimated iinear probability models for whether Wal-Mart entered a eounty (opening its 
first store, which, as Table 2 shows, would be the ease for most openings), as a funetion of the prior 
growth rate of retail employment. We add eontrols for eounty population as well as state dummy 
variables, to aeeount for the effeets of population density as well as other unspeeified features of states on 
store openings. We then did the same for the 1988-1990 period, using retail employment growth frorn 
1982-1987, and the 1993-1995 period, using retail employment growth frorn 1987-1992. The 
eorresponding maps are shown in the seeond and third panels of Figure 3. 

The results, reported in Table 3, indieate that Wal-Mart stores entered eounties — among an 
appropriate eomparison group — that had previously had faster retail employment growth. For the 1993- 
1995 estimation, the set of ten states with the most openings ineludes California and Washington, which 

35 A reeent study eommissioned by Wal-Mart (Global Insight, 15) instead does the first eomparison, and erroneously 
eoneludes that there is no evidenee of endogeneity bias. 

36 This window is arbitrary, but follows the ealeulation in the Global Insight study [15]. 
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are quite far geographieally from the other eight states and therefore the eomparison to eounties in those 
other states may not be appropriate; we therefore also show results excluding these two states. As the 
table shows, in every ease the relationship between prior growth and whether a store opened is positive 
(and sometimes signifieant, although that is not of foremost interest here). This evidenee does not 
direetly address the issue of endogeneity bias — which eoneerns the link between store openings and 
future retail employment absent store openings. But it does suggest that endogeneity is a eoneern. 
First-Stage Estimates 

Turning to our rnain analysis, Figure 4 displays information on the estimated eoeffieients of the 
distanee-time interaetions from the first stage, although for ease of inteipretability we present estimates 
for the ehange in the nurnber of stores rather than the nurnber of stores per person. The omitted eategories 
are 900-1000 miles and 1988. The first-stage regression ineludes 272 distanee-time interaetions (between 
17 year dummy variables and 16 distanee dummy variables). In Figure 4, we break these up into the 16 
distanee groups, for eaeh distanee range around Benton County. 

Starting with distanees near Benton County, Arkansas, we see that store openings were more 
likely eaidy in the sample period, and less likely later on. Conversely, at distanees far from Benton 
County store openings were rnueh more eommon later in the sample period. Alternatively, store openings 
were rnueh rnore likely at nearby distanees eariy in the sample, and at farther distanees later in the sarnple. 
Thus, these regressions eonfirm the impression frorn the rnaps in Figure 2 that store openings spread out 
over the sample period, eoneentrated near Benton County early in the sample period and farther later. 
Effeets on Retail Seetor Employment 

We now turn to the OLS and IV estimates. The OLS estimates identify the effeet of Wal-Mart 
simply from differenees in outeomes in eounties and years where stores opened eompared to those where 
they did not. In eontrast, the IV estimates identify the effeet frorn differenees in outeomes in eounties and 
yeai's with a high versus a low predieted probability of store openings, as depieted in Figure 4. Thus, for 
example, a negative effeet of Wal-Mart openings on retail employment will be inferred when retail 
employment fell (or grew more slowly) in eounty-year pairs that were in geographie regions (defined by 
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distanee from Benton County, Arkansas) and years in which store openings were rnore likely, without 
referenee to the aetual eounties within these regions in which stores opened. 

Table 4 reports estimates for retail employment. The dependent variable is the ehange in retail 
employment per person at the eounty level, and the Wal-Mart measure is the ehange in the nurnber of 
stores per person. We report estimates both excluding and ineluding eounty-speeifie tirne trends 
(equations (1) and (l’)); the latter is our preferred speeifieation. The estimates for retail employment (and 
earningsj turn out to be insensitive to the inelusion of the eounty fixed effeets, which bolsters the validity 
of the assumption that the instrumental variables are not eorrelated with the error terms. 

The OLS estimates point to inereases of 40-55 workers in both aggregate retail and general 
merehandising employment from a Wal-Mart store opening. 37 For the A sample, the estimates irnply an 
inerease in eounty-level aggregate retail employment of about 0.8 pereent at the mean. The estimate for 
general merehandising is slightly greater than the estimate for aggregate retail, for the B sample for which 
these estimates aie eomparable, suggesting that there is at most a small reduetion in employment in the 
rest of the retail seetor; however, the differenee is not signifieant. Finally, as we would expect, the 
estimated eoeffieient of total payrolls per person is positive. 

In eontrast to the OLS estimates, the IV estimates — which eonditional on the identifieation 
strategy being valid are interpretable as eausal effeets of Wal-Mart openings on retail employment — point 
to employment deelines in the aggregate retail seetor. 38 Without eounty-speeifie trends, the estimates for 
the A sample indieate that a Wal-Mart store opening reduees employment at the eounty level by about 
167 workers. With eounty-speeifie trends the estimate falls to 146. In the B sample the estimates are 
smaller by about 30. Sinee the average nurnber of workers in a Wal-Mart store is about 360 [4], the 
estimated employment deeline (using a figure of 150, elose to the equation (l’) estimate) implies that 
eaeh Wal-Mart worker takes the plaee of 1.4 retail workers. On a eounty basis, this estimate implies a 2.7 

37 The retail estimates are similar to the long-run OLS estimates reported by Basker [4]. 

38 The F-statisties for the joint signifieanee in the first-stage regression of the distanee-time interaetions that serve as 
instruments range from 3.9 to 12.8 aeross the different samples and speeifieations in Table 4. These F-statisties are 
sometimes small enough that asymptotie results may not apply well [43]. However, in sueh eases bias in the IV 
estimates is in the direetion of the OLS bias [8], and therefore sueh bias eannot aeeount for the differenees between 
our IV and OLS results. 
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pereent reduetion in retail employment attributable to a Wal-Mart store opening. This eould refleet a 
eombination of elosings of other retail establishments and employment reduetions at those establishments. 
And of eourse these estimates do not irnply absolute deelines in retail employment, but only that retail 
employment was lower than it would have been had Wal-Mart stores not opened. 

The estimated magnitude strikes us as plausible. Government data indieate that sales per 
employee are mueh higher at large retailers than at small retailers, 39 and a MeKinsey study [26] suggests 
mueh higher sales per employee at Wal-Mart than at other large retailers (in 1995, 70 pereent higher than 
Sears, and 36 pereent higher than K-Mart). In addition, a negative employment effeet would be expected 
if Wal-Mart is indeed rnore effieient and ean be profitable while eharging signifieantly lower priees. 40 
Finally, Wal-Mart is eurrently about 1 pereent of the workforce, but it was smaller during most of sample 
period, so this displaeement figure does not imply large-seale employment deelines; as noted above, it 
implies that a store opening results in a 2.7 pereent deeline in retail employment. 

For general merehandising, the IV estimates eontinue to point to inereases in employment 
(sometimes signifieant), qualitatively similar to the OLS estimates although smaller. We rnight expect 
inereases in general merehandising employment sinee this is the seetor in which Wal-Mart is elassified. 
Flowever, the evidenee that the estimated inerease in general merehandising employment is mueh less 
than the size of an average Wal-Mart store suggests that Wal-Mart reduees employment rnainly at other 
employers in the general merehandising seetor, rather than in the remainder of the retail seetor. 41 Thus, 
the estimates are eonsistent with Wal-Mart redueing overall retail employment, although shifting its 



39 See, for example, data from the 2002 Eeonomie Census, at 

http://factfinder.census.gov/servlet/IBQTable?_bm=y&-geo_id=&-ds_name=EC0244SSSZl&-_lang=en (as of 
September 21, 2006). 

40 There is a potential eountervailing positive effeet from Wal-Mart’s greater effieieney if lower priees in retail boost 
eonsumer demand enough through either ineome or substitution effeets that overall retail seetor employment 
inereases. However, this “retail seale” effeet may be srnall if the types of produets sold at Wal-Marts and the retail 
establishments with which they eompete are more likely to be inferior goods, so that mueh of the inerease in real 
ineome stemming frorn lower priees is spent in other seetors of the eeonomy (ineluding higher-end retail). 

41 There is a potentially important implieation of this finding for interpreting our estimates. In partieular, the 
estimated effeets of Wal-Mart openings eould be exaggerated if Wal-Mart store openings tend to spur the opening of 
other large retailers in the same area, in which ease we are effeetively estimating the impaet of more than just Wal- 
Mart stores. (It is unlikely that the issue is one of Wal-Mart opening where other large retailers already exist, given 
the geographie pattern of Wal-Mart openings. Also, even if this sometimes happens, our IV estimates will not piek 
up the effeets of these other stores beeause their openings will not be predieted by the first stage.) But given that 
general merehandising employment seareely rises when Wal-Mart stores open, this seems unlikely. 
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eomposition toward general merehandising; reeall, though, that the evidenee for general merehandising 
(and the B sample overall) may be tainted by endogenous sample seleetion beeause of suppressed data. 

The finding that the OLS estimates of employment effeets for the aggregate retail seetor are 
generally positive, and the IV estimates negative, is eonsistent with Wal-Mart endogenously loeating 
stores in plaees where retail growth is inereasing. As shown in the table, there is always statistieally 
signifieant evidenee of endogeneity bias in the aggregate retail seetor. We also rejeet exogeneity in the 
estimates for general merehandising, and the bias still appears to be upward, which is what we would 
expect if Wal-Mart enters markets where prospeets for general merehandising are good. However, we 
should not expect rnueh endogeneity bias in the estimated effeet of Wal-Mart on employment in this 
subseetor, and that is refleeted in the mueh smaller differenee between the OLS and IV estimates of the 
effeets of Wal-Mart on general merehandising employment. Wal-Mart may well have ehosen to loeate in 
areas where retail growth was likely to be strong, but the eomposition of the retail seetor is more likely to 
be a direet consequence of where Wal-Mart ehose to loeate than a determinant of its loeation deeisions. 
Effeets on Retail Seetor Earnings 

Table 5 turns to effeets on retail seetor eai'nings. The dependent variable is now the ehange in 
retail payrolls per person. The first three eolumns report results without eounty-speeifie trends, and the 
last three eolumns ineluding these trends. The results are similar in both eases. 

The OLS estimates indieate that a Wal-Mart store opening is assoeiated with inereases in retail 
payrolls of about $0.2 to $0.3 million in aggregate retail and about $0.44 million in the general 
merehandising seetor. In eontrast, the IV estimates indieate that aggregate retail payrolls fall when a Wal- 
Mart store opens, by approximately $1.1 to $1.7 million for the A sample; the range is slightly smaller for 
the B sample. In addition, the evidenee is weaker statistieally for the B sample and when eounty-speeifie 
trends are ineluded (not signifieant for the A sample). However, the positive effeet of Wal-Mart openings 
on payrolls in general merehandising remains, and the estimates are a bit larger than the OLS estimates. 

The payroll results largely parallel those for employment. Of eourse, rnueh of the publie debate 
over Wal-Mart eoneerns the levels of wages Wal-Mart pays and whether it drives down wages of 
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eompetitors. As pointed out earlier, beeause the CBP data do not distinguish part-tirne frorn full-time 
workers, the data eannot be used to eompute hourly wages. In addition, the data do not allow any eontrols 
for individual worker eharaeteristies that would be required to determine whether wages were ehanging 
for eomparable workers. Nonetheless, we estimated models for ehanges in retail earnings per worker, 
with the Wal-Mart variable in this ease simply measuring the ehange in the nurnber of stores. The point 
estimates were positive for the A sample (although small — indieating inereases of $7 to $47 — and 
insignifieant); they were negative and insignifieant for the B sample. Thus, there is eertainly no evidenee 
that Wal-Mart openings reduee retail earnings per worker. Taken literally, the implieation of no irnpaet of 
store openings on wages would irnply that the labor supply eurve to the retail industry is elose to perfeetly 
elastie. A highly elastie labor supply eurve may not be unreasonable, given that retail is a small share of 
the overall labor market, and is a very low-skill seetor in which labor supply ean respond very quickly 
and strongly to wage ehanges (in eontrast to a highly-skilled seetor). However, we remind the reader that 
the CBP data eould be quite misleading about wage effeets. For example, if Wal-Mart induees a shift 
toward older or full-time workers, inereased retail earnings per worker eould be eonsistent with a lower 
wage rates for eomparable workers. 42 Regardless, the eombined estimates indieate that overall earnings 
in the retail seetor deeline after Wal-Mart stores open. 
identifieation 

A natural prineipal eoneern with our approaeh is identifieation. We therefore address a few 
issues related to identifieation in Table 6, foeusing on the aggregate retail employment estimates for the 
full (A) sample. First, if we are aetually deteeting effeets of Wal-Mart stores, then paralleling the 
evidenee of adverse effeets on retail, we should fail to find evidenee of store openings on industries 
unlikely to be affeeted. Employment in serviee industries and retail seetors other than depai tment stores 
do not neeessarily provide useful falsifieation exercises of this type, sinee the loeation of retail aetivity 

42 In prineiple, one eould study these issues using data frorn the Current Population Survey (CPS) or the Deeennial 
Census of Population. However, in the CPS most eounty identifiers are suppressed for reasons of eonfidentiality. 
Census data are less atttaetive beeause they are not available for eaeh year but only onee a deeade. Furthermore, in 
downloadable Census estimates by eounty frorn all long-form respondents, neither hours, edueation, nor ineome are 
available by industry. 
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may affeet these other industries and seetors. However, it is useful to look at manufaeturing. Although 
this industry rnight be affeeted by eost pressures exerted by Wal-Mart, there is no reason why the effeets 
of Wal-Mart openings should be felt on loeal manufaeturers. As noted earlier, we are skeptieal of using 
our instrumental variables approaeh to study an industry like manufaeturing when the eounty fixed effeets 
are excluded, sinee the other industry may have strong region-speeifie trends; but we are more eonfident 
with the eounty fixed effeets ineluded. The results for manufaeturing are reported in row 2 of Table 6. 

We find that in the speeifieations with the eounty dummies, there is no effeet of Wal-Mart openings. On 
the other hand, the speeifieations without the eounty dummies show a positive effeet. The faet that we 
only get an effeet when we exclude the eounty dummies (which means we do not allow for eounty- 
speeifie trends) helps eonfirm our eoneerns about the validity of the insti ument for manufaeturing, as the 
differenees in results irnply that the instrument(s) is (or are) eorrelated with these trends. But the results 
with the eounty dummies elearly provide a better falsifieation test, and show no effeet of Wal-Mart 
preeisely where we should see none. 

A seeond eoneern is that our distanee-time interaetion instrumental variables do not simply 
eapture the beginning of the period when Wal-Mart stores first “reaeh” a region, but also periods in which 
they were filling in regions where numerous stores had already opened. For example, Figure 4 shows that 
in the band 200-300 miles from Benton County, there are higher predieted openings through 1982, when 
stores were first opening in this region, as well as in 1987, when stores were filling in this region. This is 
not neeessarily problematie for the IV strategy. The distanee and tirne interaetions prediet where and 
when stores were opened, whereas the endogeneity we are trying to purge eoneerns the ehoiee about the 
partieular eounties in which to open stores given that Wal-Mart was opening stores in a partieular region. 
So the first-stage estimates will also refleet stores opening in the 200-300 rnile band in 1987. But all 
eounties in this band in this year have a higher predieted probability of a store opening, not just the 
speeifie eounties where stores opened. Nonetheless, it rnight be preferable to foeus on the variation in 
store openings assoeiated with Wal-Mart’s initial wave of entry into a region, which may be more 
exogenous than where the eompany ehose to fill in stores. To do this, we restrieted the distanee-time 
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interaetions ineluded in the first stage to those pieking up either observations early in the sample period 
elose to Benton County (1982 or earlier and within 500 miles of Benton County), and observations late in 
the sample period far frorn Benton County (1992 or later and 1,000 rniles or nrore from Benton County). 
Figure 4 suggests that these periods and distanees eapture distinet initial waves of Wal-Mart openings. 

The estimates, reported in row 3 of Table 6, are relatively similar to the baseline estimates, although the 
speeifieation with eounty durnrny variables ineluded provides weaker evidenee (signifieant only at the 10- 
pereent level) of negative employment effeets. 43 

Finally, our identifieation strategy requires that there not be unmeasured shoeks to the dependent 
variable that are eomrnon to points on a ring of given radius around Benton County. We argued that this 
eondition seems likely to hold, sinee there is no obvious reason why all points within, say 400-500 miles 
should exhibit systematieally different ehanges in partieular periods relative to Benton County (aside 
frorn the effeets of Wal-Mart). Flowever, for short distanees around Benton County it is possible that 
there are eommon shoeks, and if it is the differenees in retail employment growth between these short 
distanees and greater distanees that identifies the effeets of Wal-Mart in our empirieal analysis, then we 
eould be pieking up a spurious relationship. In rows 4-6 of Table 6. we present evidenee assessing this 
possibility, dropping all observations on eounties within, in turn, 100, 200, and 300 miles of Benton 
County. As the table shows, the results are robust to dropping these eounties. 

Robustness and Other Analyses 

We elose the empirieal analysis by briefly deseribing some additional analyses exploring the 
robustness of our results on retail employment. and some other issues, in Table 7. As noted earlier, we 
have reported standard eiTors elustered on state and year. In row 2, we instead report estimates (which are 
the same) with standard eiTors elustered on state only, which allows temporal autoeorrelation within and 
aeross eounties in eaeh state. The standard errors are larger in all eases, although not by that mueh, and 
the statistieal eonelusions are unehanged. Note also that in eolumn (2) the standard eiTor ehanges by less. 



43 This speeifieation uses 58 instead of 272 instruments. 
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This is what we would expect if the inelusion of the eounty-speeifie trends eaptures sourees of 
dependenee between observations on the same eounty. 

We have ineluded a eontrol for total payrolls per person in all of the speeifieations reported thus 
far. Beeause this variable ineludes retail payroll, it is potentially endogenous. In rows 3 and 4 we 
therefore report two additional speeifieations: one with retail payrolls subtraeted out of total payrolls; and 
the other simply dropping the total payrolls eontrol. The qualitative eonelusions are very similar, 
although both ehanges — and rnore so dropping the total payroll eontrol — yield slightly laiger negative 
effeets on retail employment. Next, we drop eounties that never had a Wal-Mart store during the sarnple 
period. In this ease, identifieation of the effeets of Wal-Mart stores eomes only frorn the time-series 
variation in store openings for the set of eounties that got a store, providing a potentially “eleaner” eontrol 
group. The results, reported in row 5, are qualitatively similar, although the estimated negative 
employment effeets are smaller. 

We also eonsider some other differenees in the sample definition. Row 6 reports estimates 
dropping the small number of eounties with stores that elosed; these estimates are very elose to the 
baseline estimates. 44 Following that, row 7 reports estimates extending through the entire period eovered 
by the data — ending in 2002 rather than 1995 — even though, as diseussed earlier, the identifying 
relationship between tirne, loeation, and store openings is strongest through 1995. For the longer period 
the evidenee suggests sharper deelines in retail employment. Thus far we have ignored information on 
Sam’s Clubs. We do not neeessarily want to treat these as equivalent to other Wal-Mart stores, so we 
study thern in two ways. First, we ornit all eounties that had a Sam’s Club at some point during the 
sample period; 97 pereent of these eounties also had a Wal-Mart. Seeond, we use the full sample but treat 
eaeh Sam’s Club store like other Wal-Mart stores. As the table shows in rows 8 and 9, the results 
dropping the eounties with Sam’s Clubs are qualitatively similar, although the estimated effeets are 
smaller. The results are quite insensitive to simply treating Sarn’s Clubs like other Wal-Mart stores. 

44 An altemative is to ineorporate information on store elosings as reduetions in the number of Wal-Mart stores. 
However, we are skeptieal that store elosings and openings have symmetrie (opposite-sign) effeets, and we do not 
have a natural instrument for store elosings. 
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To this point we have simply used a eount of Wal-Mart stores (on a per eapita basis) to measure 
exposure to Wal-Mart. But store size varies — for example, eounties with smaller populations may get 
smaller stores 45 — and the exposure measure should perhaps take this into aeeount. Thus, we eomputed an 
exposure measure that weights by store size relative to the average store size. These estimates are 
reported in row 10, and reveal somewhat larger effeets. Note, however, that in our data size is measured 
as of the end of the sample period only, and will not piek up things sueh as the eonversion of a regular 
store to a Supereenter. 

The effeets of Wal-Mart stores may depend on the type of area in which they are loeated. In rural 
and less dense eounties Wal-Mart may have a larger impaet beeause there are fewer other large-seale 
eompetitors before they enter. Alternatively, in these areas the stores may be more likely to ereate jobs 
beeause shoppers (and workers) eome frorn other eounties. As shown in Figure 5, Wal-Mart stores 
originally opened more eommonly in rural areas, and over time shifted to more urban areas. (This 
elassifieation is based on whether the store opened in what is now elassified as an MSA.) Beeause we do 
not have a elean elassifieation of eounties loeated in MSA’s or not, we instead foeus on eounty population, 
and estimate the models for two samples: those with above median population at the beginning of the 
sample period, and those with below median population. As shown in rows 1 1 and 12, there is no elear 
eonneetion between density and the effeets of Wal-Mart stores. 

Finally, we use equations differeneed over two and then three years, rather than one year. We do 
this for a few reasons. First, there may be sorne antieipatory effeets of Wal-Mart store openings. This 
eould lead to effeets of store openings that are biased towai'ds zero, if the shorter differenees do not 
inelude these antieipatory effeets (which seern likely to be in the same direetion as those oeeurring after 
stores open). Seeond, the effeets of store openings rnight not all oeeur within a year. Allowing for longer 
differenees ean piek up longer-term effeets, and it is ambiguous whether these are likely to be larger or 
smaller. 46 Third, beeause of how we eode store openings, in about one-sixth of eases (for stores opening 

45 This may in part refleet whether or not the Wal-Mart store is a Supereenter. 

4h In prineiple, one ean study eontemporaneous and lagged effeets more direetly by ineluding lagged store openings 
as well. However, experimenting with sueh speeifieations in the IV setting led to uninformative estimates, beeause 
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between January and early Marehj the employment ehange and store opening may be misaligned, likely 
biasing the estimated effeet towards zero. By using longer differenees, we reduee this problem 
substantially. As the estimates in the last two rows of Table 7 show, with two- and three-year differenees 
the estimates are qualitatively very similar (and were for the other samples, for general merehandising, 
and for OLS as well as IV), but always slightly larger in absolute value the longer the differenees, as we 
rnight have expected from the first and third eonsiderations diseussed above. 47 
VI. Conclusions and Diseussion 

Motivated in large part by loeal poliey debates over Wal-Mart store openings, and the large size 
of Wal-Mart relative to the retail seetor, we estimate the effeets of Wal-Mart stores on retail employment 
and earnings. Critics have eharged that Wal-Mart’s entry into loeal labor markets reduees employment 
and wages, and the eompany (and others) have eountered that these elaims are false, and touted Wal- 
Mart’s retail job ereation effeets. 

Our analysis emphasizes the importanee of aeeounting for the endogeneity of the loeation and 
tirning of Wal-Mart openings that — in our view, and as borne out by the data — is rnost likely to bias the 
evidenee against finding adverse effeets of Wal-Mart stores. Our strategy for addressing the endogeneity 
problem is based on a natural instrumental variables approaeh that arises beeause of the geographie and 
tirne pattern of the opening of Wal-Mart stores, which slowly spread out in a wave-like fashion frorn the 
first stores in Arkansas. 

The findings in this paper rather strongly belie elaims, as well as reeent researeh findings, 
suggesting that Wal-Mart store openings lead to inereased retail employment. On average, Wal-Mart 
store openings reduee retail employment by about 2.7 pereent, implying that eaeh Wal-Mart employee 
replaees about 1.4 employees in the rest of the retail seetor. Driven in part by the employment deelines, 
retail earnings at the eounty level also deeline as a result of Wal-Mart entry, by about 1.5 pereent. It is 

predieted values of store openings at different lag lengths were very highly eorrelated, which is not surprising given 
the nature of our IV. 

47 Finally, although not reported in the table, we also estimated a simpler speeifieation for the ehange in retail 
employment per person on the ehange in the number of Wal-Mart stores ( with the latter not on a per eapita basis, 
unlike the rest of the estimates reported in the paper). Both the OLS and IV estimates were qualitatively similar to 
those reported in Table 4, and implied similar pereentage ehanges in retail employment. 
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harder to draw any firm eonelusions regarding the effeets of Wal-Mart on wages, although the data do not 
provide any indieation that retail earnings per worker are affeeted by Wal-Mart openings. 

Note that the estimated adverse effeets on retail employment do not imply that the growth of Wal- 
Mart has resulted in lower absolute levels of retail employment. Like in all studies of this type, the 
estimates are relative to a eounterfaetual of what would have happened to retail employment absent the 
effeets of Wal-Mart. Frorn 1961, the year before the first Wal-Mart store opened, through 2004, the last 
full year for which we have a eount of the Wal-Mart Diseount Stores and Supereenters, retail employment 
in the United States grew frorn 5.56 million to 15.06 million, or 271 pereent, eonsiderably faster than 
overall employment (242 pereent). 48 If eaeh of the 3,066 stores present in January of 2005 redueed retail 
employment by our estimate of 146 workers (Table 4, eolumn (4)) relative to the eounterfaetual, then our 
estimates irnply that, in the absenee of Wal-Mart, retail employment would have instead grown to 15.51 
million as of 2004, or 3 pereent higher than the observed figure. So the negative employment effeets of 
Wal-Mart that we estimate simply imply that retail employment growth was a bit rnore modest than it 
would otherwise have been, growing by 271 pereent from 1961 through 2004, rather than 279 pereent. 

The estimates do imply, however, that retail employment is lower than it would have been in eounties that 
Wal-Mart entered, and henee that Wal-Mart has negative rather than positive effeets on net job ereation in 
the retail seetor. 

The lower retail employment assoeiated with Wal-Mart does not neeessarily irnply that Wal-Mart 

stores worsen the eeonomie fortunes of residents of the markets that these stores enter. Our results apply 

only to the retail seetor, and we suspeet that there are not aggregate employment effeets, at least in the 

longer run, as labor shifts to other uses. Wage effeets are more plausible, although these may operate 

more on the manufaeturing side through the buying power that Wal-Mart exerts, as opposed to the retail 

side which is a low-wage seetor regardless of Wal-Mart — although there are exceptions sueh as relatively 

highly-paid groeery workers who may be harmed frorn eompetition with Supereenters. If there are wage 

(or employment) effeets that arise through eost pressures on Wal-Mart’s suppliers, however, they would 

48 See http://data.bls. gov/PDQ/servlet/SurveyOutputServlet?&series_id=CEU4200000001 and 
ftp://ftp.bls.gov/pub/suppl/empsit.ceseebl.txt (as of Deeember 22, 2006). 
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not neeessarily be eoneentrated in the eounties in which stores open, so that our methods would not 
identify thern. 

Moreover, Wal-Mart entry may also result in lower priees that inerease purehasing power, and if 
priees are lowered not just at Wal-Mart but elsewhere as well, the gains to eonsumers may be widespread. 
Furthermore, the gains may be larger for lower-income families [19], although it is also possible that 
labor market consequences for these families are also more adverse. 

Another line of eritieism of Wal-Mart is that through lowering wages it inereases the burden on 
taxpayers by inereasing eligibility for and reeeipt of government benefits. 49 Flowever, a key implieit 
assumption is that in the absenee of Wal-Mart, employees of the eompany would have higher-paying jobs, 
rather than, for example, no jobs. Thus, the validity of this eritieism hinges on whether Wal-Mart’s entry 
into a labor market affeets overall employment and wages. It is also worth pointing out that if Wal-Mart 
eauses both eai'nings and priee deelines for low-income families, then taxpayer burden eould inerease 
even if the priee deelines rnore than offset the earnings deelines for these families, beeause government 
programs are typieally tied to earnings. 

Thus, aside frorn the question of employment effeets, there are numerous remaining questions of 
eonsiderable interest regarding the effeets of Wal-Mart on labor markets and goods markets, on 
eonsumption, and on soeial program partieipation and expense. In addition, of eourse, there are non- 
eeonomie issues that play a role in the debate over Wal-Mart, sueh as preferenees for downtown shopping 
distriets versus suburban malls. The identifieation strategy developed in this paper may prove helpful in 
estimating the effeets of Wal-Mart stores on some of these other outeomes as well. 



49 A report by the Demoeratie Staff of the Committee on Edueation and the Workforce of the U.S. Congress [33] 
elaims thatbeeause of Wal-Mart’s low wages, an average Wal-Mart employee eosts federal taxpayers an extra 
$2,103 in the form of tax eredits or deduetions, or publie assistanee sueh as healtheare, housing, and energy 
assistanee (see also [11]). There are many heroie assumptions needed to eonstruet sueh estimates, and this is not the 
plaee to disseet them. 
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Figure 1: Loeation of Wal-Mart Stores, 1965-2005 
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Figure 2: Loeation of Wal-Mart Openings, 1962-2005 
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ineludes all openings as of January 31, 2005. 




Figure 3: Geographie Concentration of Wal-Mart Openings by Period and Geographie Area 
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Figure 4: First-Stage Results for Change in Number of Stores 
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Panels show estimated eoeffieients of distanee-year interaetions frorn estimates of equation (3), although 
estimated for the ehange in the nurnber of stores rather than the ehanges in the nurnber of stores per 
person. The omitted referenee eategories are 1988 and 900-1000 rniles. 
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Figure 5: Loeations of Wal-Mart Openings 
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Classification is based on loeation of store in MSA. 





Table 1: County-Level Summary Statisties, Population, Employment, and Payroll, 



1977-1995 





Means 

(1) 


Medians 

(2) 


Population 


78,223 

(256,595) 


22,644 


Number of eounties/share of total 


3,032 

0.98 




Aggregate retail employment (N=57,964) 


5,659 

(19,758) 


1,105 


Retail payrolls ($l,000’s) (N=57,964) 


92,208 

(364,022) 


14,990 


General merehandising employment 
(B sample, N=37,999) 


966 

(2,691) 


195 


General merehandising payrolls ($l,000’s) 
(B sample, N=37,999) 


14,170 

(43,099) 


2,477 



Figures are for full (A) sample except for general merehandising. In eolumn (1) 
standard deviations are reported in parentheses. Payrolls are in thousands of 1999 
eonstant dollars. 



Table 2: County-Level Summary Statisties, Wal-Mart Stores, 


1995 


Average nurnber of stores, for eounties with stores 


1.43 


Share with one or rnore stores open, all eounties 


0.505 


Share with given nurnber of stores, for eounties with stores 




1 store 


0.783 


2 stores 


0.128 


3 stores 


0.042 


4 stores 


0.017 


5 stores 


0.013 


6 stores 


0.006 


7 stores 


0.004 


8 stores 


0.003 


9 stores 


0.002 


10 stores 


0.001 


1 1 stores 


0.001 


12 stores 


<0.001 


13 stores 


<0.001 


14 stores 


<0.001 


15 stores 


<0.001 


16 stores 


<0.001 


17 stores 


<0.001 


Nurnber of eounties 


3,064 



Numbers are for full (A) sample of observations with aggregate retail 
employment data in 1995. 






Table 3: Estimated Relationships Between Store Openings and Prior Retail 
Employment Growth 



Openings in: 


1983- 

1985 


1988- 

1990 


1993- 

1995 


1993-1995, 
Excluding CA 
and WA 




(1) 


(2) 


(3) 


(4) 


Annualized Rate of 
Growth of Retail 
Employment in 
Previous Five-Year 
Period 


0.439*** 

(0.143) 


0.191 

(0.177) 


0.122 

(0.515) 


0.407 

(0.613) 


N 


854 


615 


363 


299 



Estimates are frorn linear probability models, with standard eiTors robust to 
heteroseedastieity. The states eovered in eaeh sample period are shown in Figure 
3. The previous five-year periods eoiTesponding to the three eolumns are: 1977- 
1982, 1982-1987, and 1987-1992. County population in the ending year of these 
five-year periods, and state dummy variables, are ineluded as eontrols. Growth 
rates are eomputed relative to the average at the beginning and end of eaeh 
period, beeause there are a handful of observations with retail employment of 
zero. Growth rates are measured on an annualized basis. Thus, for example, the 
estimate in eolumn (1) implies that a 1 pereent higher annual growth rate boosts 
the probability of a store opening by 0.0044; this is a 2.7 pereent inerease in the 
probability of a store opening. 






Table 4: Estimated Effeets of Wal-Mart Stores on Retail Employment 





County-Specific Time Trends Excluded 
(Equation (1)) 


County-Specific Time Trends Ineluded 
(Equation (l’)) 




Agg. Retail 


Agg. Retail 


Gen. Mereh. 


Agg. Retail 


Agg. Retail 


Gen. Mereh. 




( 1 ) 

A Sample 


( 2 ) 

B Sample 


(3) 

B Sample 


(4) 

A Sample 


(5) 

B Sample 


( 6 ) 

B Sample 


OLS 














Wal-Mart stores 


44.52*** 

(6.29) 


42.33*** 

(9.11) 


54 . 74 *** 

(3.94) 


40.01*** 

( 6 . 02 ) 


42 75*** 
(9.61) 


53.81*** 

(3.99) 


Total payrolls per person 


0 . 001 *** 

( 0 . 0002 ) 


0 . 001 *** 

(0.0003) 


0 . 0001 *** 

(0.00003) 


0 . 001 *** 

(0.0003) 


0 . 001 *** 

(0.0003) 


0 . 0001 *** 

(0.00003) 


Adjusted R 2 


0.084 


0.109 


0.081 


0.068 


0.089 


0.038 


IV for Wal-Mart stores 














Wal-Mart stores 


-166.98*** 

(48.83) 


-134.48** 

(54.26) 


29.36** 

(11.80) 


-146.35*** 

(47.77) 


-105.80* 

(56.69) 


17.92 

(14.20) 


Total payrolls per person 


0 . 001 *** 

( 0 . 0002 ) 


0 . 001 *** 

(0.0004) 


0 . 0001 ** 

(0.00003) 


0 . 001 *** 

(0.0003) 


0 . 001 *** 

(0.0003) 


0 . 0001 *** 

(0.00003) 


F-statisties 


12.8 


6.8 


6.8 


4.2 


3.9 


3.9 


Rejeet null of no endogeneity 
bias at 5% level 


Yes 


Yes 


Yes 


Yes 


Yes 


Yes 


N 


54,554 


32,668 


32,668 


54,554 


32,668 


32,668 



The Wal-Mart variable is the ehange in the nurnber of stores per person, and the dependent variable is the ehange in employment per 
person. The eoeffieients measure the effeet of one Wal-Mart store opening on eounty-level retail employment. The instrumental 
variables are interaetions between durnmy variables for years and dumrny variables for eounties with eentroids with a radius of 100 
miles frorn Benton County, Arkansas, 101-200 rniles, 201-300 rniles, ete., out to 1800 rniles. All of the radii except the last eover a 100 
mile range; the last eovers 1601-1800 miles, beeause there is only one eounty beyond the 1700 mile radius. All speeifieations inelude 
year fixed effeets, and the speeifieations in eolumns (4)-(6) also inelude the eounty dumrny variables (in both the first and seeond stages). 
Standard eiTors (shown in parentheses below the estimates) and F-statisties are ealeulated elustering on state and year, and henee allow 
arbitrary eontemporaneous eoiTelations aeross eounties in a state. The sample period eovers 1977-1995. See notes to Table 1 for 
additional details. ‘**’, and ‘***’ indieate the estimate is statistieally signifieant at the 10-, 5-, or 1-pereent level, respeetively. The 
results for the test of the null hypothesis of no endogeneity bias are based on bias-eorreeted bootsttapped empirieal distributions for the 
differenee between the OLS and IV estimates, with the bootstrapping based on states and years rather than individual observations (the 
level of elustering used in the standard error ealeulations); this is preferable to the eonventional Flausman test [17] beeause OLS is 
ineffieient if there is either heteroseedastieity aeross eounties or autoeoiTelation within eounties. 





Table 5: Estimated Effeets of Wal-Mart Stores on Retail Payrolls ($ 1 ,()()()'. s) 





County-Specific Time Trends Excluded 
(Equation (1)) 


County-Specific Time Trends Ineluded 
(Equation (l’)) 




Agg. Retail 


Agg. Retail 


Gen. Mereh. 


Agg. Retail 


Agg. Retail 


Gen. Mereh. 




(1) 

A Sample 


(2) 

B Sample 


(3) 

B Sample 


(4) 

A Sample 


(5) 

B Sample 


(6) 

B Sample 


OLS 














Wal-Mart stores 


270.2*** 

(90.0) 


166.4 

(145.1) 


458.9*** 

(38.9) 


233.1*** 

(89.7) 


172.5 

(154.1) 


427.6*** 

(39.3) 


Total payrolls per person 


0.024*** 

(0.005) 


0.020*** 

(0.005) 


0.001*** 

(0.0003) 


0.023*** 

(0.005) 


0.020*** 

(0.006) 


0.002*** 

(0.0003) 


Adjusted R 2 


0.110 


0.151 


0.067 


0.094 


0.123 


0.039 


IVfor Wal-Mart stores 














Wal-Mart stores 


-1,711.4** 

(778.1) 


-1,669.8* 

(912.6) 


684.3*** 

(132.2) 


-1,149.8 

(735.8) 


-1,547.6* 

(894.4) 


525.4*** 

(149.2) 


Total payrolls per person 


0.023*** 

(0.005) 


0.020*** 

(0.005) 


0.001*** 

(0.0003) 


0.023*** 

(0.005) 


0.020*** 

(0.006) 


0.002*** 

(0.0003) 


Rejeet null of no endogeneity 
bias at 5% level 


Yes 


Yes 


Yes 


Yes 


Yes 


No 


N 


54,554 


32,668 


32,668 


54,554 


32,668 


32,668 



The notes from Table 4 apply. The only differenee is that the dependent variable is the ehange in retail payrolls per person, so the 
eoeffieients now measure the effeet of one Wal-Mart store opening on eounty-level retail payrolls (measured in units of thousands of 
1999 eonstant dollars). The F-statisties for the first stage are as reported in Table 4. 





Table 6: Assessments of Identifieation 





County-Specific 
Time Trends Excluded 
(Equation (1)) 


County-Specific Tirne 
Trends Ineluded 
(Equation (l’)) 




Retail Employment 


Retail Employment 




(1) 


(2) 


1. Table 4 estimates, retail employment 


-166.98*** 


-146.35*** 




(48.83) 


(47.77) 


Falsifieation test, manufaeturing employment 






2. IV [first-stage F-statistic] 


160.74** 


-60.45 




(80.67) 


(73.05) 




[12.8] 


[4.2] 


Restrieted IV’s isolating Wal-Mart entry 






3. Instrument with year-distanee interaetions for pairs satisfying 


-190 47*** 


-97.15* 


(i) 1982 or earlier and within 500 rniles of Benton County, 


(59.74) 


(54.45) 


and (ii) 1992 or later and 1,000 miles or rnore frorn Benton 


[7.7] 


[7.8] 


County [first-stage F-statistic], retail employment 






Exclude eounties near Benton County 






4. Exclude eounties within 100 rniles of Benton County 


-179 27*** 


-159.34*** 


(N=53,716) 


(52.59) 


(50.86) 


5. Exclude eounties within 200 rniles of Benton County 


-197.59*** 


-177.03*** 


(N=5 1,539) 


(58.22) 


(56.80) 


6. Exclude eounties within 300 rniles of Benton County 


-207.38*** 


-185.66*** 


(N=47,321) 


(64.84) 


(64.31) 



Table reports IV estimates of eoeffieient on ehange in nurnber of Wal-Mart stores per person. See notes to Tables 1 and 4. 
All estimates are for A sample. Sample sizes are the sarne as in Table 4. Estimated eoeffieients of total payroll per person 
variable, where ineluded, are not shown. 





Table 7: Robustness Analyses 





County-Specific 
Time Trends Excluded 


County-Specific Tirne 
Trends Ineluded 




(Equation (1)) 


(Equation (l’)) 




Retail Employment 
(1) 


Retail Employment 
(2) 


1. Table 4 estimates 


-166.98*** 


-146.35*** 




(48.83) 


(47.77) 


2. Cluster on state only 


-166.98*** 


-146.35*** 




(55.94) 


(49.28) 


3. Subtraet retail payroll out of total payroll per person eontrol 


-174.07*** 


-153.74*** 




(45.07) 


(49.57) 


4. Ornit total payroll per person eontiol 


-181.58*** 


-162.07*** 




(46.12) 


(51.23) 


5. Drop eounties that never have a store (N=30,957) 


-94.40*** 


-103.10*** 




(30.63) 


(34.95) 


6. Drop eounties with elosed stores 


-166.68*** 


-144.08*** 


(N=53,746) 


(43.15) 


(47.19) 


7. Through 2002 


-184.84*** 


-215.29*** 


(N=75,482) 


(41.27) 


(59.53) 


8. Drop eounties with Sam’s Clubs 


-150.47*** 


-130.88*** 


(N=49,031) 


(43.63) 


(47.31) 


9. Combine Wal-Mai‘t stores with Sam’s Clubs 


-161.44*** 


-175.96*** 




(42.72) 


(47.81) 


10. Weighted by store size 


-216.12*** 


-223.48*** 




(59.53) 


(62.41) 


11. Counties with above rnedian population (21,000) in 1977 


-81.13 


-105.00** 


(N=27,290) 


(57.58) 


(48.90) 


12. Counties with below rnedian population (21,000) in 1977 


-103.28* 


-90.03 


(N=27,264) 


(53.48) 


(59.70) 


13. Two-year differenees 


-196.48*** 


-209.61*** 


(N=5 1,429) 


(34.73) 


(48.49) 


14. Three-year differenees 


-206.03*** 


-224.35*** 


(N=48,362) 


(35.81) 


(45.24) 



Table reports IV estimates of eoeffieient on ehange in nurnber of Wal-Mart stores per person. See notes to Tables 1 and 4. 

All estimates are for A sample. Sample sizes are the same as in Table 4 unless otherwise noted. Estimated eoeffieients of 
total payroll per person variable, where ineluded, are not shown. For the estimates labeled “weighted by store size,” stores are 



weighted by their square footage (at the end of the sample period) divided by the average square footage for the sample, to 
maintain eomparability with the baseline estimates. The speeifieations using two- and three-year differenees use dumrny 
variables for the distinet two- or three-year periods over which the differenees are eomputed, rather than dumrny variables for 
single years. That way, the same dummy variables appear in the first-differeneed and levels regressions. 





